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Captures of compact objects (COs) by massive black holes (MBHs) in galactic nuclei will be an 
important source for LISA, the proposed space-based gravitational wave (GW) detector. However, 
a large fraction of captures will not be individually resolvable — either because they are too distant, 
have unfavorable orientation, or have too many years to go before final plunge — and so will constitute 
a source of "confusion noise," obscuring other types of sources. In this paper we estimate the 
shape and overall magnitude of the GW background energy spectrum generated by CO captures. 
This energy spectrum immediately translates to a spectral density S'j^^^{f) for the amplitude of 
capture-generated GWs registered by LISA. The overall magnitude of S'j^^^{f) is linear in the CO 
capture rates, which are rather uncertain; therefore we present results for a plausible range of 
rates. S'f^^^{f) includes the contributions from both resolvable and unresolvable captures, and thus 
represents an upper limit on the confusion noise level. We then estimate what fraction of 5JJ^^'(/) 
is due to unresolvable sources and hence constitutes confusion noise. We find that almost all of the 
contribution to S'f^^^{f) coming from white-dwarf and neutron-star captures, and at least ~ 30% 
of the contribution from black hole captures, is from sources that cannot be individually resolved. 
Nevertheless, we show that the impact of capture confusion noise on the total LISA noise curve 
ranges from insignificant to modest, depending on the rates. Capture rates at the high end of 
estimated ranges would raise LISA's overall (effective) noise level [fSf{f )Y^^ by at most a factor 
~ 2 in the frequency range 1 — 10 mHz, where LISA is most sensitive. While this slightly elevated 
noise level would somewhat decrease LISA's sensitivity to other classes of sources, we argue that, 
overall, this would be a pleasant problem for LISA to have: It would also imply that detection rates 
for CO captures were at nearly their maximum possible levels (given LISA's baseline design and the 
level of confusion noise from galactic white-dwarf binaries) . 

This paper also contains, as intermediate steps, several results that should be useful in further 
studies of LISA capture sources, including (i) a calculation of the total GW energy output from 
generic inspirals of COs into Kerr MBHs, (ii) an approximate GW energy spectrum for a typical 
capture, and (iii) an estimate showing that in the population of detected capture sources, roughly 
half the white dwarfs and a third of the neutron stars will be detected when they still have > 10 
years to go before final plunge. 
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I. INTRODUCTION 



Captures of stellar-mass compact objects (COs) by massive (~ IO^Mq) black holes (MBHs) in galactic nuclei will 
be an important source for LISA, the proposed space-based gravitational-wave (GW) detector. Recent estimates 
predict that LISA will detect hundreds to thousands of such captures over its projected 3-5 year mission lifetime, 
with captures of ~ \QMq black holes (BHs) dominating the detection rate [1]. In this paper, we point out that 
most captures of white dwarfs (WDs) and neutron stars (NSs), and a substantial fraction of BH captures, will not 
be individually resolvable, and hence will constitute a source of "confusion noise," obscuring other types of sources. 
We estimate the shape and overall magnitude of the GW energy spectrum due to captures, from which we derive the 
spectral density S'^^'^{f) for the amplitude of capture-generated GWs registered by LISA. We then estimate what 
fraction of S'f^^^{f) comes from unresolvable sources, and so represents confusion noise. 

The plan of this paper is as follows. In Sec. II we briefly review what is known about the astronomical capture 
rates. In Sec. Ill we calculate the total GW energy released by a single capture and estimate its spectrum. From 
this, we abstract a model "average" capture spectrum (where the average is over the orbit's final eccentricity and the 
inclination angle t between the MBH's spin and the CO's orbital angular momentum), for a given MBH mass. We 
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then further average this model spectrum over MBH mass (weighted by the capture rate) to estimate the shape of 
the GW energy spectrum from all capture sources. (In practice, we shall only calculate the spectrum in the frequency 
interval most relevant to us: 1 — 10 mHz, near the bottom of the LISA noise curve.) 

In Sec. IV we combine this spectral shape with rate estimates to produce Si^^^^{f), the spectrum of GWs registered 
by LISA due to all capture sources in the universe. We calculate iS)j'*'''(/) in two passes: First we make a crude 
estimate, using a flat-spacetime cosmology and no source evolution, except for a sharp cut-off at early times. Then 
we incorporate cosmological effects and some simple guesses as to the source evolution, leading to results that differ 
little from our initial flat-spacetime estimate. The overall magnitude of 5^^'^^(/) is linear in the capture rates; these 
are rather uncertain, and so we consider a range of rates taken from the literature. 

In Sec. V we address the important issue of source subtraction: We ask how much of the capture background 
calculated in Sec. IV can be "fitted out" (by subtracting individual contributions from the brightest sources). The 
GWs from sources that cannot be individually resolved, we regard as confusion noise. After discussing some general 
scaling rules between source rates, confusion noise, and detection rates, we argue that most of the WD and NS 
contributions to the capture background, and at least ~ 30% of the BH contribution, is unresolvable. We shall see 
that much of the confusion noise comes from sources that have ^ 10 — 200 years to go before final plunge. Because 
the orbits of such sources are still highly eccentric, they radiate a substantial amount of energy into the LISA band 
near perihelion passage; yet, their overall signal-to-noise (SNR) output is too low for LISA to detect them in a 3-yr 
integration time. We refer to these capture sources with ;> 10 years yet to live as "holding-pattern objects" (HPOs), to 
distinguish them from COs now approaching their "terminal descent." Of course, the nearest HPOs will be detectable, 
and HPOs should account for roughly half the WD-capture detection rate — which was not previously realized. 

Finally, in Sec. VI we estimate the effect of capture confusion noise on the total effective LISA noise level 
{f^h^if)Y^'^- (This is somewhat complicated by the fact that capture confusion noise does not simply add in quadra- 
ture to the other noise sources. Roughly, this is because, in the crucial 2 — 5 mHz band near the floor of the total 
noise curve, LISA's effective noise level is dominated by "imperfectly subtracted" galactic binaries, which effectively 
reduce the bandwidth available for carrying information about other types of sources. To a first approximation, the 
capture confusion noise gets magnified by a factor that accounts for the lost bandwidth.) We conclude that the effect 
of capture confusion noise on the total LISA noise curve is rather modest: Even for the highest capture rates we 
consider, the total LISA noise level [fSf^(f)Y^^ is raised by a factor ^ 2 in the frequency range 1 — 5 mHz. While this 
slightly elevated noise level would somewhat decrease LISA's sensitivity to other classes of sources (e.g., the merger 
of a 10"* Mq BH with a lO^M© BH a.t z — 1), we argue that, overall, this would be a pleasant problem for LISA to 
have, since it would be accompanied by CO-capture detection rates at nearly their maximum possible levels (given 
LISA'S baseline design and the level of confusion noise from galactic binaries). 

For simplicity, we shall lump COs into three classes: lOM© BHs, IAMq NSs, and O.SM© WDs. (While these 
fiducial mass values are unobjectionable for NSs and accurate to within a factor ^ 2 for WDs, it may be a poor 
approximation to assume all BHs weigh IOMq. However, since the distribution function for BH masses is still poorly 
constrained by observation [2], we have opted here for simplicity.) We shall also approximate the average GW spectra 
from these three classes as having identical shapes. In reality, since; the distribution of initial conditions immediately 
after capture (especially, the distribution of initial pericenter) are presumably somewhat different for the three classes, 
the average spectra must also be somewhat different. However, since good models of these distribution functions are 
currently not available (and since our "average spectrum" is anyhow just a crude approximation), we again opt for 
simplicity and neglect any such differences between our three classes of COs. 

We use units in which G = c = 1. Therefore, everything can be measured in the fundamental unit of seconds. 
However, for the sake of familiarity, we also sometimes express quantities in terms of yr, Gpc, or Mq, which are 
related to our fundamental unit by 1 yr = 3.1556 x lO'^s, 1 Gpc = 1.029 x 10^'^s, and IMg = 4.926 x IQ-^s. 

II. SUMMARY OF CAPTURE RATES AND SOURCE PARAMETER DISTRIBUTIONS 

Here we summarize the astronomical inputs necessary for our analysis: current estimates of capture rates for the 
three CO types and estimated distributions of source parameters. We refer to Ref. [1] for a more extended discussion; 
here we mainly just quote the relevant results. 

Captures occur when two objects in the dense stellar cusp surrounding a galactic MBH undergo a close encounter, 
sending one of them into an orbit tight enough that orbital decay through emission of gravitational radiation dominates 
the subsequent evolution. For a typical capture, the initial orbital eccentricity is extremely large (typically 1 — e ~ 
10^'' — 10^"^) and the initial pericenter distance very small (r-p ^ 8 — lOOM, where M is the MBH mass) [3]. The 
subsequent orbital evolution may be divided into three stages (this division is more qualitative than strict): In the 
first and longest stage, the orbit is extremely eccentric, and GWs are emitted in short "pulses" during pericenter 
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passages. These GW pulses slowly remove energy and angular momentum from the system, and the orbit gradually 
shrinks and circularizes. After ^ 10"^ — 10^ years (depending on the two masses and the initial eccentricity — see Eq. 
(1) of Ref. [3]), the evolution enters its second stage, when the orbit is sufficiently circular that the emission can be 
viewed as continuous. Finally, the adiabatic inspiral transitions to a direct plunge, as the object reaches the last 
stable orbit (LSO). In this final, very brief stage, the object quickly plunges through the MBH's horizon, and the 
GW signal cuts off. While individually-resolvable captures will mostly be detectable during the last ^ f — f 00 yrs of 
the second stage (depending on the CO and MBH masses), radiation emitted during the first stage (mostly in short 
bursts near periastron passage) will contribute significantly to the confusion background. 

One of our goals is to estimate the ambient GW energy spectrum arising from all captures in the history of the 
universe. To this end, we shall need the following pieces of information: (i) the space density and mass distribution 
of MBHs; (ii) the distribution of MBH spins; (iii) the capture rate per MBH as a function of the CO mass; and (iv) 
the distribution of plunge eccentricities. In the following we summarize current estimates for these quantities. 

Number density of MBHs: We shall see that, for captures by a MBH of mass Af , the GW energy spectrum is 
peaked near frequencies / ^ 5Mq^ (1 + z)~^mHz, where Mg = M/IO^Mq and z is the cosmological redshift at 
emission. The space density of MBHs, as a function of MBH mass M, has been estimated by Aller and Richstone [4], 
using the measured correlations between M and the bulge velocity dispersion a (M oc tr^) in relatively nearby galaxies, 
as well as the measured correlation between a and galactic luminosity. Here we are principally interested in GWs 
that might effectively raise the floor of the LISA noise curve, in the range ~ 1 — 10 mHz. Therefore we will restrict 
attention to MBH masses in the range 0.1 < Mg < 10. For M < lO"^ Mq, the space density of MBHs, per logarithmic 
mass interval, turns out to be nearly M-independent and is given by [4] 

dN/dlogM = 2 X 10S/iTOGpc"^ (1) 

where /170 = Ho/{70 km s~^ Mpc~^) and 7 is a number of order one. The Aller and Richstone distribution corresponds 
to 7 ~ 2; however, if Sc-Sd galaxies are removed from the sample [as at least some of them (e.g., M33 and NGC 
4395) have MBH masses much lower than would be predicted from the host galaxy's luminosity], this would produce 
a more conservative estimate of 7 ~ 1 [1]. We shall retain 7 as an unknown factor of order unity. 

MBHs' spin: Theoretically, the spins of MBHs may take any value between zero and Af^. The actual values of 
astrophysical MBH spins are difficult to measure directly, and remain the subject of much debate. However, recent 
arguments suggest high spin values: J ~ 0.7 — 0.95M^ [5]. 

Capture rates: In order to survive tidal disruption before or during the final inspiral, the captured object must be 
compact: either a WD, a NS or a BH. Let us denote by TZ^{M; 0) the present-day capture rate (number of captures 
per unit proper time per galaxy) of CO species 'A' by a MBH of mass M, where 'A' stands for either WD, NS, or 
(stellar-mass) BH. Freitag's simulations of the Milky Way [6] predicted present day capture rates of TV^^ = 5 x 10~^ 
yr~^ and TZ^^ = TZ^^ = 10~^ yr~^ in our galaxy (where he assumed all captured WDs, NSs, and BHs have masses 
m = 0.6Mq, 1.4Af0, and 9Mq, respectively). An extrapolation of these results to other MBH masses yields [1] 

7e^(Af; 0) = Kf^M^^^yv-\ (2) 

where the species-dependent coefficients are estimated (from the Freitag [6] rates) by = 4 x 10~^ and 

= kP^ = 6 X 10"''. Other capture-rate estimates, including more recent simulations by Freitag [3], generally 
predict lower rates — for a survey of rate estimates, see [7] . (In particular, we note that Freitag [6] did not take account 
of NS natal kicks, which might deplete the population of NSs in the cusp by a factor ~ 10 [7].) Consequently, one 
must allow for quite large uncertainties in the above rates: More conservative estimates would be 100 times lower for 
and 10 times lower for k^^ and [1]. Hence, in our analysis we will explore the effect of k's in the ranges 

4 X 10"** < < 4 X 10-^ 

6x 10-«<«^s^«^«<6x 10-^. (3) 

Plunge eccentricities: Strong-field radiation reaction rapidly circularizes the CO's trajectory (see, e.g. Figs. 7 and 
8 of [8]). However, captured COs are initially scattered into orbits with such high eccentricity that most retain 
moderate eccentricity all the way to the final plunge. Based on Freitag's simulations [3] we estimated [8] that roughly 
half the captured IOMq BHs plunge with eccentricity larger than 0.2. Freitag's results (see Fig. 1 of [3]) suggest that 
WDs initially scatter, and finally plunge, with somewhat smaller eccentricities, but we found it hard to quantify this 
effect. 
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III. GW ENERGY SPECTRUM FROM CAPTURES 



A. Total GW energy emitted in a single capture 

It is a simple problem in classical general relativity to calculate the total energy radiated by a particle spiralling 
into a Kerr black hole on an arbitrary (inclined, eccentric) orbit, but to our surprise we could not locate the result in 
the literature [9]. Wc supply it here. 

Let a be the total energy radiated to infinity in GWs over an entire given inspiral, expressed as a fraction of the 
CO's mass, m. We want to determine a as a function of the orbit's eccentricity and inclination (with respect to the 
MBH's spin axis) just prior to plunge. Here wc neglect the GW energy emitted by the CO during the final (very 
brief) plunge, since it is smaller than the energy emitted during the adiabatic inspiral by a factor of order m/M. Also, 
we neglect the fraction of energy that goes down the black hole: Hughes [10] has estimated that fraction as less than 
one percent of the total mass in all cases. Under these assumptions, a is approximated as 

a ~ 1 - E/m, (4) 

where E is the energy of the CO at the LSO. (Recall that E includes the CO's rest mass, and takes the value of m 
for a static object at infinity.) 

Geodesic orbits around a Kerr BH are specified (modulo three initial phase angles) by three "constants of motion" : 
the energy E, the "z" component of the angular momentum L^, and the "Carter constant" Q. The "radial" equation 
of motion along each specific geodesic is given by [11] 

P^m^ ' = [E{r^ + a") - aL,] ' - A [m^r^ + {L, - aEf + Q] 



= Rir;E,L,,Q), (5) 



where a is the MBH's spin parameter (i.e., its angular momentum J per M), r is the proper time along the orbit, 
p2 _ ^2 _|_ ^2 pQg2 0^ g^jjjj ^ _ ^2 _ 2Mr + a^. {0 and r are the standard Boyer-Lindquist coordinates.) Due to radiation 
reaction, E, Lz, and Q arc not strictly constant, but slowly evolving in time. The actual inspiral orbit can then be 
thought of as osculating through a continuous sequence of geodesies. For given M and a, the LSO is encountered in 
the first instance that both of the following conditions are met: 

= (6) 

(where the derivative is taken with fixed E, L^, and Q). For a given MBH (given M, a), Eqs. (6) yield a 2-parameter 
family of solutions. For instance, one can specify Qlso and i^LSO, and then solve the algebraic Eqs. (6) simultaneously 
for {Lz)lso and Vp (pericenter radius at the LSO).^ That is, in fact, how we proceed. Next we obtain the apocenter 
radius Va as another root of R[r; E, {Lz)lso, Q] = 0, and then we use these values to determine the eccentricity and 
inclination angle l at the LSO: 

(We adopt here the definition of [10] for the inclination angle.) We thus obtain -Elso for this particular eccentricity 
and inclination. By varying the prescribed i^LSO and Qlso, we can cover the entire physical (e,cost) plane, and 
hence obtain ii^Lso(eLso, cos tLso)- 

We implemented the above algorithm using a simple Mathematica script. Figure 1 shows the results for the case 
where a/M = 0.8. For astrophysically relevant inspirals, with eLSO <; 0.35 [8], the CO emits between ~ 4% and 
^ 12% of its mass in GWs (the former value for retrograde, the latter for prograde equatorial orbits). Figure 2 shows 
the value of a averaged over inclination angles (assuming a uniform distribution in cosi), as a function of plunge 
eccentricity. In Fig. 2 the case a/M = 0.8 is compared to the Schwarzschild case (a/M = 0), where the specific energy 
at plunge is given analytically by 



^Equations (6) generally admit 10 solution pairs, most of which are unphysical. Naturally, one has to select those solutions 
with rp real and greater than the event horizon's radius, rn = M + \JM'^ — o^. 
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m-^Ei^SO = 4[(6 + 2eLSo)(3 - eLso)]"'/^ (for a = 0) 



(8) 



(see, e.g., Eqs. (2.5) and (2.8) of [12]). Fig. 2 shows that the average (over the inchnation angle t) energy emitted by 
inspirals into rapidly rotating BHs is only a little higher than for inspirals into a Schwarzschild BH. 



"Binding Energy" 1-E/m at plunge (a=0.8M) 




plunge eccentricity 

FIG. 1. "Specific binding energy", 1 — E/m, at plunge, as a function of tlie plunge eccentricity and inclination angle, for a Kerr BH 
with spin parameter a = 0.8M. The quantity 1 — E/m approximates a — the amount of energy (per CO's mass m) emitted to infinity in 
GWs during the entire inspiral. For the case a = O.SM shown here, and for astrophysically relevant inspirals with blso < 0.35, the CO 
emits between ~ 4% and ~ 12% of its mass in GWs (the former value for retrograde, the latter for prograde equatorial orbits). 

In the analysis below, we will adopt a — 0.06 as a fiducial average value, based on Fig. 2. 

B. GW energy spectrum from a single capture 

Ideally, we would next compute the spectrum f{dE/df) of GW energy emitted over the entire inspiral, for given 
values of m, M, a, clso, and llso- However, that would be a much harder problem than the one in Sec. Ill A. The 
tools to do a fully satisfactory job do not even exist yet, since there is still no working code to calculate the effect of 
GW radiation reaction on an arbitrary geodesic orbit in Kerr. Nevertheless, a reasonable job at approximating the 
orbital evolution could presumably be done today by inferring the rate of change of the GO's E and from the flux 
of energy and angular momentum at infinity, and approximating the orbital inclination angle t as fixed during the 
inspiral (Hughes [10] discusses the justification of this approximation) — which determines the rate of change of Q. 
Knowing the orbital evolution, one could obtain the GW spectrum by solving the Tcukolsky equation for the emitted 
GWs. 

However, astronomical capture rates are sufficiently uncertain that we feel justified in a much cruder approach to 
this problem: We calculate the spectrum using the approximate, analytic formalism we previously developed in [8]. In 
this formalism, the overall orbit is imagined as osculating through a sequence of Keplerian orbits, with rate of change 
of energy, eccentricity, periastron direction, etc. determined by solving post-Newtonian evolution equations. In this 
approximation, the emitted waveform h^v{t), at any instant, is determined by applying the quadrupole formula to 
an instantaneous Keplerian orbit. The quadrupole-formula waveform for an eccentric, Keplerian orbit was derived 
analytically long ago by Peters and Matthews [13], so these approximate waveforms are relatively easy to calculate. 
These waveforms correctly capture the fact that eccentric orbits have power at all multiples of the orbital frequency 
(even when considering only quadrupole radiation, as we do). We "cut off" the waveform when the orbit reaches 
the point of plunge for a Schwarzschild MBH, at rj, = (6 + 2eiso)/(l + s-lso) — thus, we are essentially restricting 
ourselves to the a = case. (This is partly because our post-Newtonian evolution equations cannot be trusted for 
prograde orbits in near-extremal Kerr. However, given the small difference between the a = and a = 0.8M cases 
in Fig. 2, and given that we will subsequently average the spectra over M, the restriction to a = here should not 
appreciably affect the final average spectrum.) 
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FIG. 2. Specific binding energy at plunge, averaged over (uniformly distributed) inclination angles, as a function of the plunge 
eccentricity. The plot compares the case a = 0.8 of Fig. 1, with the case of a non-spinning MBH (a/M = 0). In the latter case, 
the plunge energy is given by Eq. (8). An "inclination averaged" inspiral in the astrophysically relevant range of plunge eccentricities 
(fiLSO < 0.35), emits around 5—7% of the CO's mass in GWs, with only little dependence on the MBH's spin. 



Figure 3 shows the spectrum of a single capture (for a range of possible plunge eccentricities), as derived from 
the approximate waveform model described above. For each value of clso we integrated post-Newtonian evolution 
equations [Eqs. (28) and (30) in [8]] backward in time to obtain e{t) and i^(i), and, consequently, e(i'). We then 
obtained the power radiated into each of the harmonics of the orbital frequency using the leading-order formula [13] 
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(9) 



where gn(e) is given by 



9n(e) = -^{[Jn-2ine) - 2e J„_i(r7,e) + - J„(ne) -|- 2e J„+i(ne) - J„+2(«e)] 
+ {1- e^)[Jn-2{ne) - 2 J„(ne) -f Jn+2{ne)]^ + -l^[J„(ne)]2}. 



(10) 



We next re-expressed En{i') in terms of the GW frequency / by replacing v fn/n for each of the n-harmonics. 
Finally, we summed up the power from all harmonics (holding / fixed), and plotted the total power vs. /. (In 
practice, we summed over the first 20 harmonics. Note that different harmonics "cut off" at different GW frequency, 
/lso = '^I'LSO, giving rise to the "discontinuities" apparent in Fig. 3.) During the inspiral the source evolves 
significantly in both frequency and eccentricity, with the GW power distribution shifting gradually from high harmonics 
of the orbital frequency to lower harmonics. This leads to a spectrum with a steep rise followed by a "plateau", as 
manifested in the Figure. For comparison, a decaying, quasi-circular orbit would yield a simple power-law spectrum, 
fidE/df) cx f/^. 



C. Model capture spectrum, averaged over glso 



Ideally, the next step in our analyis would be to integrate the individual capture spectra over the actual distribution 
of final eccentricities, in order to obtain an appropriately weighted average. However, since that distribution is poorly 
known, and since our individual spectra are anyway approximated, we instead introduce the following model for the 
"average" spectrum (for a given MBH mass), suggested by Fig. 3: 



fidE/df) 



(.///o)^ 

const = ifp/fof, 
0, 



/ < fp, 
fp<f< 4.fp, 

f > ^fp- 



(11) 
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f(Hz) 

FIG. 3. Energy spectrum of a single inspiral, for M = 10® Mq, a = 0, and for a range of plunge eccentricities blso- '^lso is the orbital 
frequency at the LSO. These spectra are based on approximate, post-Newtonian waveforms as described in the text. Each of the curves 
represents the sum of contributions from the first 20 harmonics of the orbital frequency. Discontinuities appear whenever an n-harmonic 
reaches frequency n X i^lso cuts off. 



Here, /o is a normalization parameter we will determine soon, and fp is the plunge frequency at Clso = 0: 

fp = (27r6^/2^f) ^ = 2.20 mHz x Mg-\ (12) 



where, recall, Mg = M/{10^Mq). 

The parameter fo is determined from the total energy radiated in GWs over the entire inspiral. Let this total 
energy be am, where, recall, a is roughly our fiducial value of 0.06. We have 

a^m-' f{dE/df)d\nf ^ {f/fofdf/f+ [fp/ fofdf / f ^ (fp/ fof[l/3 + HA)], (13) 
Jo Jo J fp 

and solving for /o yields 

/o 6.74mHz x {a/QMy^'^ M^^ . (14) 
Thus the energy spectrum can be rewritten as 

r 3.27 X 10'3^f|/3 jj^, / < /p, 

f{dE/df) = {m/MQ)MQ X (a/0.06) x I 3.49 x 10-^, /p < / < 4/p, (15) 

[ 0, / > 4/p , 

where /mHz = //(I mHz). This spectrum is illustrated in Fig. 4. Our model spectrum is not particularly accurate, 
but it is good enough for our purposes: We will further average it over the MBH mass (in section HI.B.), and the 
result of this averaging will anyway "smear out" the exact details of the shape shown in Fig. 4. For our application, 
what matters most is that our model spectrum is peaked at roughly the right frequency, / ~ SHz/Mg, that it is 
relatively narrow (most of the energy is contained within a factor '--^ 5 — 10 range in frequency), and that it "contains" 
the right total amount of energy. 



D. Model capture spectrum, averaged over MBH mass 

The above model spectrum corresponds to a given MBH mass M . We next average this spectrum over M, with 
the average being weighted by the capture rate for MBHs of mass M . 
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f (Hz) 

FIG. 4. Our model spectrum [Eq. (15)] for a single inspiral with M = IO^Mq, averaged over inclination angle and final eccentricity, 
and normalized to a = 0.06. The n*'' GW harmonic cuts ofl^ when f/n reaches the plunge frequency, fp, causing the spectrum to flatten 
out for fp < f < '^fp- We cut off our model spectrum at / > 4/p , where we no longer find emission from any of the dominant harmonics 
(n ~ 2 - 4). 



From Eq. (2), the (current) rate of captures by MBHs of mass M is oc The space number density of MBHs 

with masses 0.1 <J Mg < 10 is approximately constant in this range [see Eq. (1)]. Therefore, the average GW energy 
spectrum from a single capture (per unit mass of the CO) is approximately 
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(/) = 0.1923 (a/0.06) / {dMe/M(i)M^^^ x <^ 3.49 x lO^^, 



3.27 X 10-3M|/3h„ / < fp, 



0.1 



0, 



fp<f< ^fp, 
f > 4fp, 



(16) 



where fp = fp{Me) is given in Eq. (12), and the prefactor 0.1923 is just [J^l{dMe/Me)MQ^^]~^. 

To evaluate the integral in Eq. (16) we consider separately 5 different frequency ranges: 
For f < 0.22 mHz we have f < fp for any M in the integration domain. This low-frequency range is therefore 
controlled by the "tail" part of the spectrum, and we find 



/•lO 

e(/ < 0.22 mHz) = (a/0.06) 6.297 x IQ-'^fl^^ / dMgMg 

J 0.1 

= (a/0.06) 4.425 X 10- VmHz- (17) 
For 0.22 < f < 0.88 mHz we have / < 4/p for any M, and now there are both "tail" and "plateau" contributions: 

r2.2//„,Hz 



^19/8 



e(0.22 < / < 0.88 mHz) = (a/0.06) 



Ml^'^dMQ + 6.710 X 10-3 



6.297 X 10-V^Hz 

= (a/0.06) 4.243 x 10-^(1 - 0.5039 /"nf). 
For 0.88 < f < 22 mHz we need to cut off the integration at Afg = 8.8 mHz//: 

r2.2//,„H 



M-^l^dMe 



2.2//„Hz 



(18) 



e(0.88 < / < 22 mHz) = (a/0.06) 



6.297 X lO-VmHz 

= (a/0.06) (1.907 x IQ-^ fJl^ - 7.881 x 10'^ fl^. 
For 22 < f < 88 mHz there remains only a plateau contribution: 

/■8.8//„H 

e(22 < / < 88 mHz) = (a/0.06)6.710 x 10" 



Ml^'^dMa + 6.710 x IQ-^ 



8.8//„Hz 



2.2//„,Hz 



(19) 



M-^'^dMf, 



(a/0.06) 4.046 x IQ^'^ifJl^ - 0.1866). 



(20) 



Lastly, we have 
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e(/ > 88mHz) = 0. 



(21) 



To summarize, the energy spectrum from captures (per unit CO mass), averaged over all captures by MBHs with 
masses in the range 10^ to lO"^ Mq, is 



e(/) - (a/0.06) x <^ 



( 4.425 X 10-' fU: 
4.243 X 10-2(1 - 0.5039 /„^|f) 

1.907 X 10-2/,;|f - 7.881 X IO^^/^hz: 0.88 < Uh. > 22, 



/niHz < 0.22, 
0.22 < < 0.. 



(22) 



4.046 X 10-2 (/J// - 0.1866), 
I 0, 



22 < /niHz < 
/niHz > 88 . 



This spectrum is depicted in Fig. 5. 



a=0.06 





f(Hz) 

FIG. 5. Model energy spectrum for a single capture, averaged over MBH mass and normalized by a = 0.06 — see Eq. (22). Note the 
apparent sharp drops in the spectrum at frequencies below ~ 10~^ Hz and above ~ 2 X 10~^ Hz are simply an artifact of our restricting 
attention to MBH masses in the range 10^ — IO^Mq. 



IV. SPECTRAL DENSITY OF CAPTURE BACKGROUND 



A. Total energy spectrum of capture background 



Using the above estimates, we want to determine the ambient GW spectrum from all captures in the history of the 
universe. To simplify the analysis, we will first assume that spacetime is flat, that the capture rate has been fixed at 
TZ'^{0) for the last T = (3 ■ 10^° years, and that previous to that the capture rate had been zero. In Subsection IV D 
below we carry out a more careful analysis, including cosmological and evolutionary effects, and show that the above 
simplified version actually gives reasonably accurate results, especially if one makes the judicious choice (3 = 0.7. 

Using our flat spacetime assumptions, the GW energy density spectrum from captures of species A, cipQ^/d(ln/), 
is just the average energy spectrum per capture, m'*e(/), times the current total capture rate per unit co-moving 
volume, 7^'^(0), times T. From Eqs. (1) and (2), the factor 7^^(0) is 

^.10 

7^^(0) = / n'^{M;0){dN/dMe)dMe 
Jo.i 

= 1.040 (k^/IQ-'^) 7 h% Gpc"Vr^^ • (23) 
Using dp^y^/dilnf) = m^7^^(0)^e(/), with the conversion factor lAfo/Gpc^ = 4.521 x 10"^^ s-^ ^ we then find 
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3.291 X 10-- (^)(A][^]lh^roX e(/) sec^, (24) 



d{\nf) ' \MqJ \0.7 J \10 

where, recall, e(/) is given in Eq. (22) above. 



B. LISA noise model 



Our estimate of dp-Q^^ / d{lnf) leads directly to an estimate of the spectral density of the capture background 
measured by the LISA detector. But before proceeding to this, we briefly state our conventions and review standard 
estimates for the magnitudes of other noise sources. For more details we refer the reader to Sec. V.A of Ref. [8]. 

LISA'S GW data stream is basically equivaknit to the output of two synthesized Michelson interferometers, where 
in each Michelson the angle between the two arms is 60°, and where the two Michelsons — hereafter labelled I and II — 
are rotated with respect to each other by 45°. This representation is only accurate for low frequencies, / ^ 0.01 Hz. 
However, most of the SNR from captures will indeed be accumulated at these lower frequencies, so this approximation is 
adequate for our purpose. At low /, our synthesized detectors I and II are basically equivalent to the data combinations 
lablelled A and E in the terminology of Time Delay Interferometry [14]. The output si{t) of synthetic-Michelson I is 
the sum of resolvable GW signals hi{t) and noise ni{t); similarly, S//(i) = nuit) + huit). 

In keeping with the usual convention in the LISA literature, we define Sh{f) to be the single-sided, sky- averaged 
noise spectral density for either synthetic detector, I or II: 

{ni{f) MIT) = (niiif) finif'T) = ^S{f - HShU), (25) 

where a tilde denotes the Fourier transform and "( )" stands for the expectation value. The factor ^ here is the 
product of the following three factors: \ from the "single-sided" convention, | = sin^60° because the angle between 
the arms is 60°, and \ from an average over source directions and polarizations. [Note here we are using a different 
convention than in our earlier paper [8], where Sh{f) was defined without the sky-averaging factor.^] 

LISA'S noise has three main components (besides confusion noise from captures): instrumental noise, confusion 
noise from short-period galactic binaries (mostly WD binaries), and confusion noise from short-period extragalactic 
binaries. For LISA's instrumental noise, 5'^"^'(/), we use the curve available from S. Larson's online sensitivity curve 
generator [15], which is based on the noise budgets specified in the LISA Pre-Phase A Report [16]. For f ^ 5 mHz, 
the instrumental noise curve is well fitted analytically by [17] 

S'^^'^f) = 6.12 X 10-^V~* + 1-06 X lO-^'o + 6.12 x 10"^^^ Rz'^, (26) 

where / is in Hz. 

Next we turn to confusion noise from galactic and extragalactic WD binaries (GWDBs and EGWDBs, respectively). 
Any isotropic background of individually unresolvable GW sources represents (for the purpose of analyzing other 
sources) a noise source with spectral density [18] ^ 

5^^^/)=^/-^!^. (27) 

Here and throughout this paper, our convention is to use the "calligraphic" Sh to represent the spectral density of 
the entire background from some class of sources, including both resolvable and unresolvable sources. The fraction of 
Sh due to the unresolvable sources, which we refer to as "confusion noise" , we denote by Sh with an Italic typeface. 



^Note the PRD version of [8] also contains errors, as follows: The RHS of Eq. (48) should be multiplied by |; the factor 5 
in Eq. (55) and in the sentence below it should both be replaced by and the same for the factor 5 in the three sentences 
preceding Eq. (60). 

^Note the RHS of our Eq. (27) is a factor 5 = (20/3)sin^(7r/3) larger than the RHS in Eq. (3.6) in [18]. The factor 
20/3 = 5/sin2(7r/3) is our usual conversion factor between LIGO and LISA conventions, and the sin (7r/3) = I factor arises 
because a smaller opening angle reduces LISA's sensitivity to background GWs. Thus the two factors of sin (tt/S) cancel. 
The convention followed in this paper is such that the ratio of <S^'"''^^^(/) to dpow/dln/ is independent of the opening angle 
between the arms. 
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We shall also use the "upright" typeface for the instrumental noise, S]^^*^. Estimates of dpcw / d\n f from the galactic 
and extragalactic WD backgrounds [19,20] then yield the following background spectral densities: 

= 1.4 X 10-44 (^^) '^'bz-\ (28) 

^EGWDB(_^^ ^ 2.8 X 10-46 (^^^ (29) 

(In fact, there is practically no distinction between ^egwdb ^^^^ ^egwdb^ since the entire extragalactic contribution 
is assumed to consist of unresolvablc sources.) 

The GWDB background is actually larger than LISA's instrumental noise in the range ^ 10"4-1G^^ Hz. However, 
at frequencies / ^ 2 x IQ-'^Hz, galactic sources arc sufficiently sparse, in frequency space, that one expects to be 
able to "fit them out" of the data. Still, even resolvable sources introduce an additional kind of "noise" since they 
can never be subtracted out perfectly. To the extent that such "subtraction errors" can together mimic other classes 
of astrophysically interesting signals, they effectively diminish the sensitivity of LISA to those other signals. (This 
effect is also referred to simply as "confusion noise" within the LISA community, though conceptually it can be useful 
to consider the two kinds of confusion noise separately.) In the case of GWDBs at frequencies above ^ 2mHz (i.e., 
at frequencies where they become individually resolvable), this second type of confusion noise acts effectively like a 
multiplicative factor on all other types of noise. E.g., if one is searching for coalescing MBH binaries, but the signal 
from some fraction F{f) of the frequency bins near frequency / are rendered effectively unusable for this purpose 
because of degeneracies with the fitted GWDBs, then it is as if the noise spectral density were increased by a factor 
(1-F)-M21]. 

We denote by Sf^^ the total effective noise, including the multiplicative effect from this second type of confusion 
noise, and adopt the following rough estimate of , based on Hughes [22] 

Sf{f) = min{[ST\f)+Sr™{f)] eM^T-l..iondN/df), Sr\f)+Sr'^''''{f)+S^'^^^{f)} . (30) 

Here dN/df is the number density of GWDBs per unit GW frequency, Tmission is the LISA mission lifetime (so 
A/ = 1/Tmission IS the bin size of the discretely Fourier transformed data), and k is the average number of frequency 
bins that are "lost" (for the purpose of analyzing other sources) when each galactic binary is fitted out. For dN/df 
we adopt the estimate [22] 

and take nT'^^^^^^^ = 1.5/yr (corresponding, e.g., to Tmission « Syr and n « 4.5). 

Even though the subtracted noise model, Eq. (30), basically represents an "educated guess," it has the right 
qualitative features: The very steep slope of dN/df means that a factor 3 increase in / is sufficient to reduce the 
source density from (say) 7/bin to 1/(8 bins). The "unsubtractable part" of the galactic WD curve must therefore 
fall very steeply near / ~ 1 — 2 mHz — a feature accounted for by the exponential function in Eq. (30). Also, a factor 
(say) 2 change in our fiducial value for kT~^^^^^^ would only shift the "drop-off" frequency by a factor 2^^^^ w 1.2. 
Therefore we believe Eq. (30) must be a reasonably accurate representation of the total effective noise. 

There are likely tens of thousands of CO capture sources (out to 2; = 1) in the LISA band at any instant, and we shall 
see that a signficant fraction of them arc not resolvable. How should we modify Eq. (30) to take into account confusion 
noise from CO captures? We shall see that even if all CO captures were unsubtractable, so that the entire capture 
background is "counted" as confusion noise, the CO confusion noise would still be smaller than the (pre-subtraction) 
noise from GWDBs, though it could be larger than the instrumental noise (depending on the exact capture rates). In 
this case, it seems sensible, as a first approximation, to simply add the CO confusion noise contribution, S'/^^''^^{f), to 
S™^^{f) in Eq. (30). That is, we estimate the total effective noise density as 

Sfif) = min { [ST\f) + ^r'^^^l/) + cM^T-'dN/df), 

Sr\f) + 5^'^°^(/) + S^^'^if) + • (32) 



^Actually, Hughes [22] writes Sf{f) = min {ST'if) cMi^T-^si.ndN/df), SL-'(/) + '5,?^°^(/)} + 5,?G^°S(/), but it 
makes no sense to treat jJEGwdb (ijiffgrently from S}^^'^ in this context, so we have added them together in Eq. (30). However, 
^EGWDB^j^ is sufficiently small that this modification has almost no effect in practice. 
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Note that Eq. (32) docs not include the effect of "subtraction errors" made in removing the resolvable CO captures. 
We can reasonably neglect this noise contribution, for the following reason. Over a ~ 3— yr mission, the number 
of individually detected captures will probably be 3 x 10^ [1]. Each capture source is completely specified by 17 
parameters [8]; thus ^ 5 x 10"* real numbers arc required to specify their combined signal. In comparison, the signal 
(from both synthetic detectors) between 1 and 10 mHz is represented by ~ 4 x 10^ real numbers (i.e., roughly 10^ 
discrete frequency bins, times two real numbers per bin, times two independent detectors), so in principle only 1% 
of the available bandwidth is "lost" by fitting out these captures. 



C. Noise spectral density 



We now obtain S^^^p^ (J) — the spectrum of the background from all capture sources of type A — by simply combining 
Eqs. (22), (24), and (27). This yields 



7/^70 



X < 



18.540 

l-778/-L(l- 0.5039/-^//), 
3.302 X 10-6, 



0-799/jy' 



3/8 



l-695/-i^,(/^Hz 
0, 



0.1866), 



10- 

/mHz < 0.22, 

0.22 < /„Hz < 0.88, 
0.88 < /mHz < 22, 

22 < /mHz < 88, 
/mHz > 88. 



(33) 



Recall that here a is the total amount of energy (per CO's mass m) radiated over the course of a single inspiral (cf. 
Fig. 2), /3 is the time (in units of 10^° yr) during which captures occur, k"^ is an event rate parameter [see Eq. (3)], 
and 7 is a parameter of order unity that normalizes the space-density of MBHs. This capture background is plotted 
in Figs. 6, 7, and 8 (for captures of WDs, NSs, and BHs, respectively), superposed on LISA's instrumental noise and 
WD-binaries confusion noise. Each of these figures plots the background for a range of assumed capture rates k^; the 
other parameters in Eq. (33) are all assumed to have their fiducial values: a = 0.06, /? = 0.7, and 7 = 1.0. The WD, 
NS, and BH masses are 0.6, 1.4, and lOM©, respectively. 



D. Influence of cosmology and source evolution 

Our calcidation of <S^'^^*(/) in the previous subsection ignored source evolution and treated the spacetime as flat — 
counting all sources within /? x 10^° It-yr of us and ignoring the rest. Here we repeat the calculation but for a 
realistic cosmology and a few different evolutionary scenarios. We shall see that if we set (3 to 0.7, then in the crucial 
1 — 5 Hz range (where capture noise could possibly have a significant impact on LISA's total effect noise level), this 
more-accurate treatment results in differences from our flat model at a level ^ 15%. This uncertainty associated with 
source evolution is still much smaller than the uncertainty in the current event rate. 

Let £(/, z)A/ be the rate (per imit proper time, per imit co-moving volume) at which GW energy between fre- 
quencies / and f + Af is emitted into the universe, at red-shift z. (Here / is the frequency measured by a contempo- 
raneous observer, not the red-shifted frequency we measure today.) For instance, for source type A, the rate today is 
S{,f, 0) = TZ^ {0)m^ f e{f) . The dependence of f (/, z) on z arises principally from two effects. Firstly, the capture rate 
in the past, TZ^{M;z) (measured per unit proper time, per MBH) can differ from today's rate TZ^{M;0). Secondly, 
the MBHs were smaller in the past (since they are growing by accretion), which implies that the spectrum £{f,z = 2) 
(say) is blue-shifted compared to the spectrum £{f,z = 0). (We stress that this blue-shifting is a source-evolution 
effect. There is also a cosmological red-shift, to be accounted for momentarily.) The total number of MBHs (per unit 
comoving volume) may also evolve due to mergers, but we expect this effect to be small between z ^2 and now, and 
so for simplicity we take this total number to be constant. 

To calculate the GW energy density today, in some frequency band of width A/, one integrates the contributions 
from all times, after appropriate red-shifting of the waves' frequency and energy: 

where A/' = A/(l + z) is the blue-shifted (relative to today) frequency range of the GWs when initially emitted, and 
the factor 1 -|- 2; in the denominator accounts for the decrease in the waves' energy due to the redshift between z and 
now. Obviously the two (1 -|- z) factors cancel, so Eq. (34) can be simplied to [23,24] 
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Instrumental 

- - Instrumental+WD binaries 
Background of WD Captures 




10" 



10" 



f(Hz) 



10" 



FIG. 6. Comparison of the background gWDcapt jj.^^^ WD captures (dash-dot hnes) with LISA'S instrumental noise (sohd hne) and 
with LISA'S instrumental plus confusion noise from WD binaries (dashed line). We show three cases, corresponding to WD capture rates 
of = 4 X 10"'', 4 X 10"^, and 4 X lO"*— cf. Eqs. (2) and (3). 



|=£j[/(l + .),4d.. (35) 

We shall assume a spatially flat {fl = 1.0) Friedman-Lemaitre-Robertson- Walker universe, with JIa = 0.70 and 
= 0.30. For our fiducial cosmology, the universe's current age tq is given by 

TO 0.964i7(7^ = 1.39 x 10^"/!^ Y^- (36) 

Here we shall consider only sources in the range < z < 2. (We will see below that sources at 2; > 2 are unlikely to 
add considerably to pcw-) Iii this range, the following power law relation between the universe's age t and redshift 
z is accurate to within ^3%: 

T-To(l + 2)-i-i«. (37) 

For our estimates in this section, we will approximate this as r = tq(1 + z)^^'^. 

To get some idea for the range of possible answers, we shall consider two possible scalings for the capture rates 
TZ^{z) and two possible scalings for the MBH mass M{z) (i.e., four possible scenarios in all): For the capture rates, 
simulations of Milky Way captures by Freitag [6] , as well as an analytic argument by Phinney [25] , both suggest that 
7i^{z) should increase in the past like r~^/^ oc (1 + z)°-^. For comparison, we will also consider an 7i^{z) = const 
scenario. Similarly, for the MBH masses, we will consider as possibilities (i) M = const and (ii) M (x r^/^ (as 
suggested by Sirota et al. [26]). The four different assumptions yield the following four different relations between 
£{f,z) and£(/,0): 



£{f, 0), 7^^(z) = const, M = const, 

^ I (l + z)0-6f(/,0), 7^^(z)cx (l + z)0-6, M = const, 

^•'' (l + z)-0-6f[/(l + z)-0-6,0], 7^^(z) =const, Moc (l + z)-0-6, ^^""^ 

£[f{l + z)-0-6, 0] n^{z) cx (1 + z)0-6. A/ cx (1 + z)-"-^. 



To derive the third line on the RHS, note that under a rescaling of MBH mass, M — > M' — XM, with total luminosity 
fixed, the spectrum gets re-scaled according to £'{f')Af' = £{f)Af, where /' = A^^/, or 
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£'(/')= A £(A/'). (39) 

In our case, M' is the MBH mass at redshift z, so A = (1 + z)^°-^, and the third hne foUows when we replace the 
dummy variable /' by /. Lines 2 and 4 on the RHS of Eq. (38) are obtained by simply multiplying lines 1 and 3 
(respectively) by (1 + z)*^'^ (the assumed rate enhancement factor for early times). 

Again, we are interested in the CO capture spectrum chiefly in the range 1 < / < 7 mHz. If we are to integrate 

Eq. (35) out to z = 2 then we must know £{f,0) for / between 1 mHz and 7 x (1 + 2) = 21 mHz. In this range, 
f (/, 0) is basically a power law: £{f,0) cx f~^e{f) oc f~^^^^ (see Fig. 5). 

For the remainder of this section, we will approximate £(/, 0) by the above simple power law; i.e., we assume 
£{f,0) — for some W. Using dr/dz ~ — 1.2ro(l + z)^^'^ and integrating Eq. (35) out to z = 2, we then 

obtain 

^ = 1.2 To t y-^-^^^+'^+'^dy, (40) 

df Ji 

where y = (1 + z). In the exponent, ^ = [for R^{z) = i?'^(0)] or 0.6 [for R^{z) = (1 + z)°-^R^{Q)], and ti = [for 
M{z) = M(0)] or (-0.6)(-ll/8+ 1) = 0.225 [for M{z) = M(0)(1 + z)-°-^]. Note that the shape of the spectrum is 
always the same; only the amplitude varies with evolutionary scenario. We can write the result in the form 

^ = Wr,sr''/^ (41) 
df 

for some "effective" integration time Toff. Performing the elementary integral in Eq. (40) (taking h^o = 1), we find for 
our 4 cases: t^s = 6.1 x lOV (m = ^ = 0), Tcff = 7.5 x lO^yr = 0.6, a = 0), Toff = 6.6 x lOV (/x = 0, ct = 0.225), 
and Toff = 8.1 X lOV (a^ = 0.6, a = 0.225). 

By comparison. Toff for the flat-universe/no-evolution model considered in the previous subsection would just be 
the total integration time: (3 x 10^°yr. Thus, if we make the choice /3 = 0.7, our flat-universe/no-evolution calculation 
agrees with all four of our cosmological/evolutionary models to within ^ 15%. We note that 7 x 10^ years ago 
corresponds to z = 0.79. We also note that in all four models, most of the contribution to the integral comes from 
the range < z < 1; the contribution from 1 < z < 2 accounts for only 12 — 18% of the total (depending on the case). 
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- - Instrumental+WD binaries 




10"' 10"' 10"' 

f^Hz)^ 

FIG. 8. Same as in Fig. 6, but this time showing the background 5,^ from all BH captures. We show two cases, corresponding to 

BH capture rates of = 6 x 10"'' and 6 X 10~**. 

This justifies our cutting ofi^ the integral at z = 2, which is convenient, since our simple evolutionary models would 
not be trustworthy at higher z. 

V. CONFUSION NOISE FROM CAPTURES 

In Sec. IV we estimated the spectrum of the background due to all captures. At / = 3 mHz (i.e., 

near the bottom of LISA noise curve) S'f^^^^{f) equals the instrumental noise level 5'™''*(/) for k,^^ — 1.6 x 10~^, 
rNS _ g_g X 10^^, or kP^ — 9.5 x 10^^. At the high end of their estimated ranges, the actual rates are larger 
than these values by factors of ~ 2.5, 1, and 6, respectively. Thus, confusion noise from capture sources could have 
a significant effect on the total LISA noise level, making it is important to next consider what fraction of S^^^^{f) 
actually constitutes an unresolvable confusion background (or, equivalently, what portion is resolvable and hence 
subtractable). With this goal in mind, we first make some general remarks on how the confusion noise level depends 
on both the event rate and the available subtraction method. 



A. General scalings of source rate, detection rate, and confusion noise 

In this subsection, we step back and discuss the general phenomenon of confusion noise in GW searches. Consider 
some class C of astronomical sources, having some event rate TZ, measured in Mpc^^yr^^. (For sources that are 
always "on", e.g. GWDBs, we would define TZ as just the spatial density, measured in Mpc""^.) We first ask: How 
does the detection rate V vary with 7?,? Here, for simplicity, we will imagine that source types other than C contribute 
negligibly to confusion noise; i.e., all noise is either instrumental noise or confusion noise from C-type sources. Then 
for sufficiently low TZ, confusion noise will be negligible, so the average distance ddot out to which one can detect 
individual C-type sources is a constant (i.e., independent ofTZ) set by the instrumental noise level. Clearly, then, the 
detection rate T> clearly grows linearly with TZ at small TZ. But as one increases TZ, eventually one must reach a point 
where confusion noise from C-type sources dominates over the instrumental noise, and so determines ddet- Clearly, for 
sufficiently large TZ, almost all events add to the confusion noise, so S^{f) oc TZ, and therefore d^ct oc TZ~^^^. If d^ct 
is in the regime where space can be approximated as Euclidean and the source distribution can be approximated as 



15 



spherically symmetric (generally true when ddct is much larger than typical separations between galaxies, but much 
smaller than a Hubble length: roughly lOMpc < ddct < IGpc), then the detection rate V cx TZd^^^ oc TZ~^^'^, for large 
TZ. Thus the individual-source detection rate V must peak at a certain rate TZc, which is roughly the event rate at 
which confusion noise starts to dominate over instrumental noise in setting ddot- This is illustrated in Fig. 9. 



Event rate density, R 

FIG. 9. Illustration ol how the detection rate "D for some source class C scales with the event rate H (number of sources per unit time 
per unit space volume). For sufficiently small TZ, we have "D oc TZ, whereas for very large TZ we have X> oc TZ~^^^. Hence, the detection 
rate must peak at a certain event rate, TZc- TZc is roughly the rate where confusion noise from C-type sources begins to dominate the total 
noise level. 

Next, we ask how the confusion noise S^{f) from unresolvable sources scales with the rate TZ.^ For low values of 
TZ, Sf^{f) should be a fixed fraction of S^{f) (arising from those sources whose SNR is too small to permit individual 
detection), and so grows linearly with TZ. On the other hand, for very high TZ, where confusion noise limits the 
detection rate, the fraction of source energy that is "unresolvable" grows and approaches one at sufhciently high TZ; 
i.e., S^{f) approaches S^{f ). Thus, S^{f) grows approximately hnearly with TZ at very high TZ, too, but with much 
larger slope than at low TZ. Between these two regimes, near TZc, S^if) grows nonlinearly with TZ — see Fig. 10. 





Event rate density, R R 

c 

FIG. 10. Illustration of how the confusion noise SjJ scales with the event rate density TZ. is linear in TZ in both the very small TZ 
and very large TZ regimes. It depends on TZ non-linearly around the rate TZc, where confusion noise begins to dominate the total noise 
level. 

To recapitulate: First, we have observed that the event rate TZ where 5^ begins to dominate the total noise is 
roughly the event rate that gives the highest individual-source detection rate for C-type sources — so one is lucky to 
be in that position! Second, the confusion noise level is generally hardest to estimate precisely when it begins to 



^Of course, S^{f) is a function, not a single number, so here we mean its value at some typical frequency, where it strongly 
affects, and is affected by, the detection rate "D. 
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dominate, since there it depends nonlincarly on TZ. It is easy to see the problems that arise when S'j^{f) vs. TZ is in 
the nonhnear regime: When confusion noise is the dominant noise source, one cannot determine which sources are 
detectable without first knowing the confusion noise level; yet, to calculate the confusion noise level, one must know 
which sources are detectable and hence subtractable. 

We imagine that in practice one gets around this "chicken-and-egg problem" by an iterative procedure of the 
following sort. At the very beginning, before any sources have been identified, the entire background must be treated 
as confusion noise. Then one identifies the very brightest sources (i.e., those with highest SNR) and subtracts them. 
With the noise level so reduced, it will be possible to identify somewhat weaker sources. One subtracts them as 
well, and iterates until there is no clear improvement. The confusion noise is roughly "what's left over" from the 
background after the detectable sources have all been subtracted. (Note that in practice, for LISA, this strategy is 
complicated by the fact that there are at least two distinct and important sources of confusion noise: CO captures and 
compact binaries. Moreover, the galactic binaries are always "on", and so as time goes by, more and more galactic 
binaries will be individually identified, and then subtracted from the entire LISA data stream. Thus a CO capture 
that plunges in 2014, and is slightly too faint to be detected then, might become "visible" in the old data starting in 
2017, thanks to improved cleaning out of galactic binaries.) 

Finally, we mention that there are cases where S'^ (/) is particularly simple to estimate: If only a tiny percentage of 
C-typc sources can be individually identified even at very small TZ (i.e., even at rates where S'^{f) does not dominate 
the total noise), then S^{f) is approximately S^{f) for all rates TZ. We shall see that WD and NS captures are 
basically in this "easy to estimate" category, but captures of IOMq BHs are not. For the latter, we shall content 
ourselves with some less-accurate estimates (rather than trying now to simulate the iterative procedure outlined 
above) . 



B. Dependence on subtraction method 

We want to estimate what fraction of g^'^'^P^ is due to detectable sources. Unfortunately, computationally practical 
methods for detecting captures arc likely to be substantially less sensitive than the "optimal" (but here completely 
impractical) method of coherent matched filtering. The parameter space of CO capture waveforms is 17-dimcnsional; 
a full matched-filter search for a single source over this space (using a discrete set of template that densely cover 
the space like a mesh), has been estimated (very roughly) to require ~ 10^° templates [1]. Therefore, the threshold 
signal-to-noise ratio SNRthrcsh required for a 1% false alarm probability in a search over the entire template bank is 
given by erf(SNRthresh/-\/2) = 10"''^, or SNRthresh ~ 14. (Note that at such high thresholds, the exact value of the 
threshold is quite insensitive to our estimate of the number of required templates; increasing the estimate to 10^° 
templates only raises the threshold to SNRthrcsh ~ 15, assuming Gaussian noise.) 

Here SNR means the optimal, matched filtering SNR using the complete LISA data set — approximated in this 
paper as the output of two orthogonal Michelsons. There are some well known tricks for speeding up the search 
(e.g., of the ^ 10^" templates mentioned above, a factor ^ 10^ comes from simple time translations of otherwise 
identical templates, and this subspace of time-translations can be searched very efficiently using FFTs [27]). Still, it 
is completely impractical to imagine searching directly over this vast set of templates. In Gair et al. [1], a strategy 
was outlined of searching for CO captures in a hierarchical fashion; the signal strength required for detection with this 
strategy was estimated to be SNRthresh ~ 30, for a 3-yr integration with realistic computing power (where "realistic 
computing power" at the start of the LISA mission in ^ 2013 was estimated using Moore's Law). 

For the purpose of estimating confusion noise levels, in this paper we shall assume that individual sources are 
detectable in practice if their 3-yr, matched-filtering SNR (in detectors I and II combined) is ^ SNRthresh, where 
SNRthrcsh = 30. We shall also see that our basic conclusions would change very little if the SNRthresh were half or 
double this value. 



C. Estimating the fraction of subtractable capture sources 

We now estimate what fraction of the capture background S^'^^'^^{f) is subtractable, and what is the leftover portion 
that constitutes the capture confusion noise S^'^'^^^{f). We estimate this as follows. Let Pq^^^ be the local (i.e., near 
LISA) GW energy density from captures of type A (again, A = WD, NS, or BH). We ask what fraction of Pq'^^^ 
is due to sources that are undetectable by LISA. We then estimate that Sf^'^'^^f) = U^S-^''''^'\f). Of course, this 
estimate is quite crude in particular, it ignores any frequency-dependence in the ratio Sf^'^"'^^''(f)/Sf^'^'^^^^{f) — but it 
seems good enough for our purposes. [A less crude version would be to subdivide A into many smaller subclasses, 
parametrized by (at least) the MBH mass and the time r left before final plunge. One would then estimate the 
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spectrum from undetectable sources in each subclass, and then average all the spectra (weighted appropriately) to 
get S^'^'^^^{f). This would give a nontrivial frequency dependence to S^'^'^^*'{f)/S^'^'^^^{f).] 

For our estimates, we will need to know how the GW luminosity from capture sources varies over their inspiral 
history and also how their detcctability (i.e., their 3-yr SNR) increases over this time. For a given capture, let t be 
the time left until the CO plunges into the MBH. We estimate the luminosity as a function of time, E{t), using the 
Peters and Matthews result [13] (derived using the quadrupole formula) for the quasi-Newtonian elliptical orbits: 

^ = ^(m/M)2(27rM!/)io/3(l - e^'/^ [l + {73/24)e^ + (37/96)6^] . (42) 

etc D 

Here v and e are the orbital frequency and eccentricity, respectively, whose evolution is described to lowest order 
by [13] 

^ = -^(m/M3)(27rM!y)iV3(i _ e^y7/2 n ^ (73/34)6^ + (37/96)e^] , (43) 
at IOtt 

^ = --^(TO/M2)(27rM!/)8/3(l - e2)-5/2(304 + 1216^). (44) 
at 15 

We integrate Eqs. (43) and (44) backwards in time, from t-Lso = (27rM)~^[(l — e^)/(6 + 2e)]^/^, to obtain i'{t) and 
e{t) [c.f. the discussion around Eq. (59) in [8]]. We then integrate E{t) numerically to determine the amount of 
energy E{t) emitted before time t = — r, as well as the fraction E{t)/E{0), where £^(0) is the total GW energy 
emitted from the beginning of the inspiral until plunge. The outcome from this calculation is presented in Fig. 11 for 
"typical" captures of a WD, a NS, and a BH (with m = 0.6, 1.4, and lOM©, respectively); more specifically, we took 
clso = 0.15 and a = (i.e., the MBH is Schwarzschild), and considered a range of MBH masses. Note that a large 
fraction of the GW energy is emitted long before the source is detectable by LISA. In a typical BH capture, roughly 
40% the GW energy will have been emitted already 10 years before the BH plunges. If the captured object is a WD, 
40% of the energy will have been emitted 150 years before plunge. 

For such typical sources, the GWs emitted near plunge are in the LISA band, so one might naively have thought 
that the GWs emitted 10 — 150 yr before plunge would be at frequencies well below the LISA band. But this is not 
true, since most of the radiation is emitted in relatively high-frequency bursts (i.e., high harmonics of the orbital 
frequency) near periastron passage. To illustrate this, in Fig. 12 we plot the signal from a WD captured by a IO^Mq 
MBH at 1 Gpc. This plot shows the evolution of the signal during the last 1000 years of inspiral, as distributed 
among the first 20 harmonics of the radiation. We see that early in the inspiral history, the radiation is dominated by 
the high harmonics, which are well within LISA's sensitivity band. Thus the capture source is effectively "in band" 
throughout the entire inspiral, and so is always a potential source of confusion. 

The second piece of input required for our estimate of subtractablc noise is the SNR output from typical sources, 
as a function of r. Following Finn and Thorne [17] we write the two-detector (sky- aver aged) SNR^ as a sum of 
contributions from all harmonics of the orbital frequency: 

SNR2 = 2S„ / ^^^dln/„, (45) 

where 



he,n = {nd)-^^2En/fn (46) 

is the "characteristic amplitude", fn = niy, and En is the power radiated to infinity by GWs at frequency /„, given, 
to lowest order, by Eq. (9) above. The results are shown in Fig. 13. In practice, we summed over the first 100 
harmonics. Also, we performed the integral in the time domain, using /„ = nz>, with v given by the lowest-order 
formula, Eq. (43). 

Of course, the SNR results we estimate this way cannot be very accurate, in general, since they arc based on quasi- 
Newtonian orbits and lowest-order radiation reaction formulae. Their inaccuracy can be gauged, to some extent, 
by comparison with exact (to numerical accuracy) values obtained by Finn and Thorne [17] for the case of circular, 
equatorial orbits. For the last year of inspiral of a IOMq BH into a IO^Mq Schwarzschild MBH, and using essentially 
the same LISA instrumental noise curve that we use, but neglecting confusion noise, Finn and Thorne find the (sky- 
averaged) value SNR = 72 [28] for one synthetic Michclson. For the same case (and again neglecting confusion noise), 
our method estimates SNR = 105 (again, for one synthetic Michelson). Therefore our SNR is 45% too high for this 
case. Unfortunately, for eccentric orbits, and even for circular orbits several years before the final plunge, there is no 
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FIG. 11. Percentage of GW energy radiated (out of the total energy radiated during the entire inspiral) as a function of the time r left 
to plunge, for typical captures of a WD (upper left panel), a NS (upper right panel), and a BH (lower panel), for a range of MBH masses. 
The masses of the WD, NS, and BH are taken to be m = 0.6, 1.4, and IOMq, respectively, and in all cases the CO plunges at eccentricity 
elso = 0.15. A substantial fraction of the energy is emitted long before plunge, when the source is still not detectable by LISA. This 
energy is mostly emitted in relatively high-frequency bursts near periastron passage, and so contributes to the confusion noise. 

"gold standard" that we can compare our SNR estimates to. We shall assume that our SNR estimates are correct to 
within a factor 2, and we shall see that this accuracy is good enough for drawing our main conclusions. 

In the analysis below we shall assume that SNRthrcsh — 30, so sources are detectable out to an average distance 
rfdct of 

ddot/(lGpc) = (average 3-yr SNR @ 1 Gpc)/30, (47) 

where the 3-yr SNR is calculated as described above. However, given the various approximations and uncertainties (the 
rough nature of our SNR calculation, the uncertainty in our estimate of SNRthrcsh j the fact that typical observation 
times might be ~ 5 or more years instead of 3), we shall also point out how our results below would change if c?dct 
were a factor two larger or smaller than our estimate. 



1. Subtracting the resolvable WD captures 

In Sec. IV above we estimated the spectral noise density ^^'^'^^p* fj-om all WD captures (see Fig. 6). What portion 
of this noise is subtractable? 

Consider first WD captures that are completed (i.e, the WD is swallowed by the MBH) during the LISA mission 
lifetime (~ 3 — 5 yr). From Fig. (13), we see that on average SNR~ 9 for such captures, so from Eq. (47) we estimate 
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FIG. 12. GW output from the last 1000 years of inspiral of a captured WD at 1 Gpc. Shown (solid convex curves) are the characteristic 
amplitudes hc,n [see Eq. 46] of each of the first 20 harmonics (labeled n = 1, . . . , 20), for a capture of a O.QMq WD by a 10® Mq nonspinning 
MBH, with plunge eccentricity 0.15. The signal curves are superposed on LISA'S instrumental noise (solid concave curve) and LISA'S noise 
curve including confusion from WD binaries (dashed curve). In each /ic,n curve, the right end corresponds to the instant of plunge, the 
loft end corresponds to 1000 years before plunge, and dots along the curve correspond (from right to left) to 10, 100, and 500 years before 
plunge. Note that even 500 years before plunge, most of the GW energy is radiated "in band" — i.e., in the frequency range where LISA 
is most sensitive. Although at such early times the orbital frequency is well "below band," most of the energy radiated then is carried by 
high harmonics of the orbital frequency. 



t^dot ~ 0.3 Gpc. Since the proper-motion distance^ at z = 1 is 3.3 Gpc, and since the contribution from sources at 
z < 1 is around 85% of the contribution from all z (see the discussion in Sec. IV D above), we estimate that a mere 
(0.3/3.3) X 0.85 ~ 8% of the energy from the sources that plunge during LISA's lifetime is subtractable. 

Consider next the contribution to the confusion energy from captures that are not completed during LISA's lifetime. 
Clearly, for these captures (which have a smaller 3-yr SNR, and so must be closer to us than ddet 0.3 Gpc to be 
detectable), the portion of subtractable energy will be even smaller than the above 8%. Let us attempt to estimate 
this portion, very roughly: Examining Fig. 13, we find that the 3-yr SNR from typical WD captures drops by a factor 
^ 5 over the last ~ 100 years of inspiral. Thus WD captures with ~ 100 years to live can be detected out to an 
average ~ 0.06 Gpc. These detectable sources contribute ~ 2% of the capture noise (since this fraction scales as the 
detection distance), so averaging ^ 2% and 8%, we estimate that of all the capture noise from sources with — 100 
years until final plunge, ~ 5% is resolvable. According to Fig. 11, however, these sources contribute, on the average, 
only around 55% of the total confusion energy — the rest is attributed to captures that have more than 100 years to 
plunge when LISA observes. From the sources with > 100 years to go until plunge, we similarly estimate that ~ 1% 
of the capture noise is resolvable. Hence, we estimate the overall fraction of subtractable energy from WD captures 
is roughly 0.55 x 5% + 0.45 x 1% w 3%. 

In conclusion, we estimate that ^ 97% of the capture noise from WDs represents an irreducible confusion noise: 

^WDcapt^^^ « 0.97 X 5^°^"P*(/). (48) 

If we were to assume that c?dot is twice as large as estimated above (say, because we have underestimated the signal 
strength, or the low-frequency noise is significantly better than the design goal, or the mission lifetime is much longer 
than 3 years, or because our estimate of SNRthrosh was too pessimistic), then the fraction of subtractable energy 



We use the proper-motion distance dhi for estimates here, as opposed to the luminosity distance or angular diameter 
distance, since the number of sources closer than (Im scales like d\j, assuming that their number per co-moving volume is 
time-independent . 
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FIG. 13. Combined SNR (from two synthetic Michclsons) as a function of time r left to plunge for the three cases shown in Fig. 11, 
at a fiducial distance of 1 Gpc. Each SNR data point represents the matched filter SNR for a 3-yr-long waveform segment centered at t. 

would only increase to 6%. Clearly, in either case it is a good estimate to simply take Sj^^'^^^^ {f) « S^^'^^^^{f). Of 
course, this last approximation becomes even better if d^ct is smaller than estimated above. 



2. Subtracting the resolvable NS captures 

We next apply similar arguments to estimating the fraction ^NScapt^^NScapt unsubtractable noise from captures 
of NSs. Fig. 13 shows that NS captures with r less than a few years can be detected to a distance ~ 0.6 Gpc — 
roughly twice as far out as for WDs. Therefore, the same reasoning as above for WDs suggests that roughly 16% 
of the GW energy impinging on the detector from NS captures is resolvable and hence subtractable (i.e., twice the 
fraction we found for WDs). Examining Figs. 13 and 11 again, we find that captures that plunge ~ 40 years after 
LISA ends operations are detectable to a distance of ~ 0.17 Gpc, so ~ (.17/3.3) * 0.85 « 4% of the energy from 
these is subtractable. Estimating that half the NS confusion background comes from sources with — 40 years before 
plunge, and half from sources with > 40 years, and estimating that the average subtractable noise portion for these 
two classes are 10% and 2%, respectively, we find (very roughly) 

^NScapt^^^ « 0.94 X 5^'^'"P*(/). (49) 

If we were to assume NS captures are actually detectable to twice the distance estimated above, we would still 
have S^^'^^^^{f) w 0.88 x S^'^'^'^'^^ {f) . Clearly, as with the WDs, it is a good first approximation to simply take 

r»NScapt / r\ rtNScapt / r\ 

U) ~ '^h u)- 
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3. Subtracting the resolvable BH captures 



From Fig. 13 we estimate that BH captures that plunge during LlSA's lifetime will have an average 3-yr SNR of 
~ 80 at 1 Gpc. and so will be visible to ddot ^ 2.7 Gpe (so almost to z = 1). Hence, unlike the situation with WDs 
and NSs, most of the energy from BH captures that plunge during LISA 's lifetime is probably attributed to resolvable 
sources, and hence is subtractable. 

To estimate the confusion noise level we refer again to Figs. 13 and 11: BH sources with ^ 10 years to go before 
plunge are detectable only out to ^ 0.7 Gpc, so ^ 80% of the backgroimd from these sources is not subtractable. From 
Fig. 11, roughly 40% of the GW energy is released more than 10 years before final plunge, and so (averaging 80% and 
100%, and then multiplying by 0.4) we estimate that at least ~ 35% of the BH background is unsubtractable: 

^BHcapt^^)^Q35^^BHcapt^^)_ (50) 

(Note the RHS is a lower limit, since here we have not included the hard-to-estimate contribution from unresolvable 
sources with r < 10 yr.) If we instead assiune rfjot = 1-4 Gpc for BHs with 10 years to go before plunge (i.e, twice what 
we estimated above), then the same steps yield S^^^^^^ {f) ^ 0.3 x 5^^'^^^*(/). On the other hand, if ddet is half what 
we estimated above, then ^ 40% of the energy from BHs that plunge during LlSA's operation will not be subtractable, 
and for sources with ~ 10 years to go before plunge, the unsubtractable fraction increases to 90%. Thus overall, the 
unsubtractable portion would be ~ 75% 0.5(0.4 0.9)0.6 0.5(0.9 -M.0)0.4]; i.e., 5'^"'=^p*(/) « 0.75 x 5^"'=''p*(/). 

The above estimates were made assuming ^^^^^^^p* ^ j") (joes not significantly raise the total effective noise level. 
However, if the BH capture rates are at the high end of the estimated levels, then S^^'^"'^*^ (f) does significantly raise 
the total noise curve, thereby reducing the distance out to which the BH sources can be resolved. Then ^BHcapt^^^ 
would be in the intermediate regime shown in Fig. 10, where it is difficult to estimate without going through the 
entire recursive procedure described in Sec. V A. 

For this reason, at the highest BH capture rates (k^^ ^ 3 x 10^^), our present work is simply too crude to place 

very restrictive limits on the ratio S'^^'^^'^*(/)/5^^'^'*^*(/). Instead, we quote the following range 

^BHcapt^^^ = (0.3 - 1) X ^BHcapt^^^ ^g-^^ 

(noting that the upper end of this range would be approached only if the BH capture rate is very high, so that capture 
noise significantly raises the overall noise level) and leave it to future work to improve this estimate. 



VI. LISA'S TOTAL NOISE CURVE 



Having estimated what fractions of the three CO backgrounds are not subtractable, we may finally plot their effects 
on LISA'S total noise curve Sf^{f). Figures 14-16 depict S^{f), as derived from Eq. (32), with the contributions 
from the different CO species (WDs, NSs, or BHs) considered separately. [Recall from the discussion around Eq. (32) 
that in the range / ^ ImHz, the effect of the capture confusion noise (like the ones of instrumental noise and EGWDB 
background) is effectively increased by a factor that counts the bandwidth lost when fitting the GWDBs.] In the case 
of WDs and NSs we have included all capture noise as confusion noise, since our above estimates suggest that the 
subtractable portion of the noise would be small for these sources. For BH captures we have assumed that 0.3 of the 
capture noise is unsubtractable, but have also shown the extreme case where 5'^^^^?* — ^BHcapt j^^^ (51)]. For 
each species we also refer to two cases, corresponding to the lower and higher end of the estimated event rate for that 
species. Note that the astrophysical event rate remains the main source of uncertainty in our analysis, and clearly 
overwhelms the uncertainty introduced by our crude estimate of the ratio ^^apt^^capt 



VII. SUMMARY, IMPLICATIONS, AND FUTURE WORK 

We have seen that astrophysical rates for CO captures are either near (within one or two orders of magnitude) or 
at the point where confusion noise from these sources begins to dominate LlSA's noise curve. That is, basically, a 
good situation: Such event rates maximize the detection rate for these interesting sources. Moreover, even at the high 
end of estimated rates, LlSA's total noise level [fS^{f)Y^^ is raised by less than a factor ~ 2 by the CO capture 
confusion noise, so other LISA science (such as detections of MBH-MBH mergers) is not jeopardized. 

Let us also highlight an important point that is implicit in Fig. 13: For WD capturess, HPOs will account for 
roughly half the detections. (Again, HPOs are "holding-pattern objects," i.e., COs detected 10 or more years before 
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FIG. 14. Total LISA noise (dash-dot line), including instrumental noise, confusion noise from WD binaries, and confusion noise from 
captures of only White Dwarfs. The two total-noise curves correspond to the higher and lower end of the estimated event rate for WD 
captures. (For the lower rate, the level of WD capture confusion noise is low enough that the corresponding total-noise curve appears to 
coincide in this plot with the dashed curve, representing instrumental and WD-binaries confusion noise only.) By virtue of Eq. (48), we 
have assumed here that the entire capture background ,5^Dcapt (.Qjj^^[.j)-m^Qg ^^jjf, confusion noise 5^Dcapt rpj^^ total noise curve has 
been calculated using Eq. (32). 

they plunge). To see this, in Fig. 17 we plot (in arbitrary units) the detection volume Vdot times the time until plunge 
T, versus Inr, for our fiducial sources. This uses the same information as contained in Fig. 13, since we have simply 
taken Vdet oc d^^^ oc (SNRQlGpc)'^; however, since the total number of detected captures (for each type of source) 
is proportional to / Vdet^ d In t , this representation allows one tell at a glance the relative importance of HPOs to 
the total detection rate. In particular, we find that ^ 1/2 of detected WDs and ^ 1/3 of detected NSs will have 
T > 10 yr. While the BH detection rate will be dominated by sources with t ^ 3 yr, ^ 5 — 10% of BH detections will 
have T > 10 yr as well. [This last estimate is based on a linear extrapolation of the curves in Fig. (17) to larger r 
values. Unfortunately, at r ^ 12 yr, for the lower MBH mass, the BH orbits attain very high eccentricities (e > 0.9), 
rendering our code unreliable.] Clearly, the results here are quite crude (most importantly, the results in Figs. 13 
and 14 are all based on a single, fiducial value of the final eccentricity, blso = 0.15), but still the moral seems to be 
clear: When searching over the large parameter space of possible capture signals, it will be worthwhile to construct 
template families that include waveforms from HPOs. We also point out that many of the HPOs that LISA detects 
will still be "alive" when some next-generation LISA is fiown (and, for some of the WDs, next-next-generation LISA, 
etc.) which may allow for especially sensitive tests of GR in the future. 

Clearly, the analysis in this paper has been crude in many ways. In particular, most of the estimates in Sec. V were 
based on taking results from a few fiducial cases, and then "averaging by eye." Nevertheless, the uncertainties due to 
our approximations are clearly dwarfed by the uncertainties in the astrophysical rates. Moreover, our basic conclusion 
regarding WDs and NSs — that most of the GW energy we receive from these captures is not resolvable by LISA and 
so represents a confusion noise — seems very robust. For the BH case, our estimated range of S'J^'^p* is rather broader: 
^capt ^ 2 _ 1) X iS^'^P*. However, here too, the basic moral remains clear: To raise LISA's overall effective noise 
level by even a factor ~ 2, the BH capture rate would have to be at the high end of its estimated range, resulting in 
several hundred detections per year — a compensation devoutly to be wished. 
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FIG. 15. Same as in Fig. 14, but this time showing the total LISA noise when capture confusion from only Neutron Star sources is 
taken into account. By virtue of Eq. (49), we have assumed here that the entire background cS^^ '^^^^ contributes to the confusion noise 
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FIG. 16. Same as in Fig. 14, but with capture confusion from only Black Hole sources taken into account. The three dash-dot hnes 
show the total noise curve under different assumptions as to the event rate and the fraction of the background that is subtractable. 
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FIG. 17. The detection volume Vdct times r (the time from the middle of the 3-yr observation until plunge), as a function of Inr, for 
our three fiducial sources. The total number of detected captures is proportional to the integral J VdctTdlnr. Roughly half (one-third) of 
the WDs (NSs) that LISA detects will have r > lOyrs. BH detections will be dominated by sources with r < 3 yr, but roughly 20% will 
have T > 5yr. 
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